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Q ' Abstract 

A goodness of fit test for the drift coefficient of an ergodic diffusion 
process is presented. The test is based on the score marked empirical 
process. The weak convergence of the proposed test statistic is studied 
under the null hypotheses and it is proved that the limit process is a 
continuous Gaussian process. The structure of its covariance function 
allows to calculate the limit distribution and it turns out that it is a 
function of a standard Brownian motion and so exact reject regions 
can be constructed. The proposed test is asymptotically distribution 
free and it is consistent under any simple fixed alternative. 
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1 Introduction 

Let X be an ergodic diffusion process on R, solution of a stochastic dif- 
ferential equation, that is a strong Markov process with continuous sample 
paths which satisfies dX t = S(X t )dt + a(X t )dW t , for t > 0, with some ran- 
dom initial value X . We consider the nonparametric problem of testing 
the unknown drift coefficient S on the basis of the continuous observation 
X T = {X t : < t < T}. We present a goodness of fit test for the drift 
coefficient of a diffusion process based on the statistic sup^, |Vr(x)| where 



As usual 1a denote the indicator function on a set A. Following Koul and 
Stute (1999), we call Vr the score marked empirical process. We prove that 
the test based on the statistic sup^, |Vr(x)| is asymptotically distribution free 
under each simple null hypothesis S — So and it is consistent under any 
simple fixed alternative S = Si ^ S . 

Despite the fact of their importance in applications, few works are devoted 
to the goodness of fit test for diffusions up to our knowledge. So the con- 
struction of goodness of fit tests for such kind of model is very important and 
needs very detailed studies. Kutoyants (2004) discusses some possibilities of 
the construction of such tests. In particular, he considers the Kolmogorov- 
Smirnov statistics A T (X T ) = sup x \fT\F T {x) — F So (x)\. The goodness of fit 
test based on this statistics is asymptotically consistent and the asymptotic 
distribution under the null hypothesis follows from the weak convergence 
of the empirical process to a suitable Gaussian process (Negri, 1998 and 
Van der Vaart and Van Zanten, 2005). However, due to the structure of 
the covariance of the limit process, the Kolmogorov-Smirnov statistics is not 
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asymptotically distribution free in diffusion process models. The statistic 
proposed here does not have this problem. 

We study the weak convergence of the score marked empirical process 
under the null hypotheses and we prove that the limit process is a continu- 
ous Gaussian process. The structure of its covariance function allows us to 
calculate the limit distribution of the proposed statistic. It turns out that it 
is a functional of a standard Brownian motion with known distribution and 
so we can construct exact reject regions. 

Koul and Stute (1999) proposed such kind of statistics based on a class of 
empirical process constructed on certain residuals to check some parametric 
models for time series. They studied their large sample behavior under the 
null hypotheses and present a martingale transformation of the underlying 
process that makes tests based on it asymptotically distribution free. Some 
considerations on consistency have also been done. For the same model 
studied here the problem of testing different parametric form of the drift 
coefficient is well developed (see for example Lin'kov, 1981, Kutoyants, 2004 
and references therein). 

The work is organized as follows. In the next section we present the model, 
its properties and some general conditions and assumptions used through all 
the text. In Section[3]we prove the weak convergence of the proposed statistic 
in a more general context. This result is interesting by itself. Section H] is 
devoted to the presentation of the test problem and to the study of the 
proposed statistic under the null hypothesis. Finally in Section [5] we study 
the behavior of the statistics under the alternative hypothesis and we prove 
that the test is consistent. 
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2 Preliminaries 



Given a general stochastic basis, that is, a probability space (Q,A, P) and 
a filtration {^4i}t>o of ^4, let us consider a one dimensional diffusion process 
solution of the following stochastic differential equation 



where {Wt '■ t > 0} is a standard Wiener process, and the initial value 
X = £ is independent of W t , t > 0. The drift coefficient S will be supposed 
unknown to the observer and the diffusion coefficient a 2 will be a known 
positive function. Let us introduce the following condition. 

£S. The function S is locally bounded, the function a 2 is continuous 
and bounded and for some constant A > 0, the condition xS(x) + a(x) 2 < 
A(l + x 2 ), xeR, holds. 

Under condition £ S the equation (JTJ) has an unique weak solution (see 
Durrett, 1996, p. 210). The scale function of a diffusion process solution of 
the stochastic differential equation (JTJ is defined by 



The speed measure of the diffusion process ([T]) is defined by m>s(dx) = 
-, J 7? \ dx. Let us introduce the following condition: 
1ZV. The scale function is such that 




(1) 




lim p(x) = ±oo 



and the speed measure rris is finite. 
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If the condition 1ZV is satisfied then the process {X t : t > 0}, weak 
solution of (CQ), has the ergodic property (see for example Gikhman and 
Skorohod, 1972 or Durrett, 1996), that is, there exists an unique invariant 
probability measure us such that for every measurable function g e C\{ns) 
we have with probability one, 



1 ' T 



o 



Moreover the invariant measure /is has a density given by 



ms{R)a{y) 2 { J a{v) 2 



where 



is finite. 



m 5 (R) = [°° -^exp (2 f dv) dx 

J-00 v{x) 2 I Jo a ( v ) J 



3 A limit theorem 

In this section we present a theorem on the weak convergence of a stochastic 
process that is interesting by itself. Let us consider a diffusion process X = 
{X t : t > 0} on an open interval ICR, that is a strong Markow process 
with continuous sample paths taking values on /, not necessarily solution of a 
differential stochastic equation of type (pQ). Assume that X is regular, which 
implies that the scale function p and the speed measure m of the diffusion are 
well defined (see Rogers and Williams, 2000). Under the assumption that the 
speed measure m is finite, and denoting by /i the normalized speed measure, 
fj, = it follows that the diffusion process X is positive recurrent and it 
has the ergodic property, with fi as invariant measure. For every x G / and 



5 



t > 0, the local time for the diffusion X in the point x at time t is denoted 
Lf(x). The process {Lf(x) : x £ I,t > 0}, is countinuous and increasing in 
t and continuous and nonnegative in x. The main theorem for local time is 
the occupation time formula. For a diffusion process it can be written as 



for every measurable function h : / — > R. Let us denote the diffusion local 
time for X with respect to the speed measure (see Van der Vaart and Van 
Zanten, 2005 and ltd and McKean, 1965) as lf(x) = L?(x)p'(x). The ran- 
dom function x — > lf(x) can be chosen continuos and has compact support. 
If the measure m is finite then it holds that 



See theorem 4.2 of Van der Vaart and Van Zanten (2005) and also Van Zanten 



Let a standard Wiener process W be given on the same stochastic basis 
where X is defined. Let us consider the process M = {M t (ip) : t > 0, ip E J 7 } 
defined by 



where ip belongs to a countable class T of elements of C 2 (I ,m(dx)). To 
measure the distance between functions in T we use the semimetric p 



For every ip G T the process M(ip) = {M t (ift) : t > 0} is a continuous 
local martingale. Following Nishiyama (1999), definition 2.1, the quadratic 






(3) 



(2003). 
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/9-modulus for the process V = {V t (ip) : t > 0,if) € J 7 }, where V t (i^) 
■^M t (ip), is defined as 



y/\ (M(f) - M(<p)) t 



P ,t = sup — r 



Here (M) = {(M) t : t > 0} denotes the quadratic variation process of a 
continuous local martingales M. 

Let us denote with N(e, J 7 , p) the smallest number of closed balls, with 
p-radius e > 0, which cover the set T . 

Theorem 1. Let T C £ 2 (I,m(dx)) be countable. Then for all 5, K and 
t > it holds that 

r 5 

E sup \V t y>)-V t (<p)\l . <cK v/log7V(e,^,p)de, 

pW>,<p)<s J 

where c > is an universal constant and {£ t '■ t > 0} is a stochastic process 
which satisfies £t = Op(l), as t goes to infinity. 



Proof. We have 



1 



(vm t = - / HXsYds 



and from the occupation formula (J2J) for the diffusion local time of a diffusion 
process we can write 



- [ ifj{X s ) 2 ds = - I \p(x) 2 l$(x)m(dx). 
Uo ^ J I 



So we have 



1 



|V|| p ,t = sup — - < J-supZf (x). 
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Now recalling ([3]) the result follows from the theorem 2.3 in Nishiyama (1999) 
if we pose & = ^\ sup^ lf(x). □ 

Let us denote with £°°(JF) the space of bounded functions Z : J 7 — > R 
equipped with the uniform norm ||^||oo = su P v e^ |iT(</?)|. 

On the space we introduce the Gaussian process {T(tjj) : ip G J 7 } 

with mean zero and covariance function given by 




Theorem 2. Let J 7 C £ 2 (J, m(dx)) 6e countable and A/log iV(e, JF, p)de be 
finite. Then the family of stochastic maps {V t (ip) : ip G J 7 } weakly converges 
on the space I 00 {J 7 ), as t goes to infinity, to the Gaussian process {T(ip) : 
^eJ 7 }. 

Proof. The convergence of the finite dimensional laws (Vt(il>i), ■ ■ ■ Vt(ipk)) to 
the law of (T(ipi), . . . , T (■?/>£.)), for every finite k follows from the central limit 
theorem for stochastic integrals (see Kutoyants, 2004). The tightness fol- 
lows from Theorem [TJ assuming that Jq 1 -\/log N(e, J 7 , p)de is finite (see also 
Nishiyama, 2000). □ 



4 Goodness of fit test 

Let us introduce our testing problem. Suppose that we observe the process 
{X t : < t < T}, solution of the stochastic differential equation (TjQ) and we 
wish to test the two different simple hypotheses 

H : S = Sq 
Hi : S = Si, 
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where Si 7^ S means 

sup \Si(x) — So(x)\ > 0. 

We suppose that S and Si belong to the class S a defined for a fixed function 
a as 

S a = {S : conditions £S and VSP are fulfilled} . 

For every iGl, let us introduce the score marked empirical process 

Vt(x) = -±=j l^^X^dX, - S (X,)dt) 

The process {Vr(x) : x G R} takes values in Cb(R), the space of the contin- 
uous bounded function on KL Let us introduce in this space the cr-algebra 
of borel set B generated by the open sets of C B (1R) induced by the norm 
11/11 = ™ VxeR \f{x)\ for every / G C B (R). 

Let us introduce our test procedure. Fix a number e G (0, 1) and let 
us consider the class of asymptotic test of level 1 — e or size e. Given any 
statistical decision function 4> T = <Pt(X t ), the expected value of 4>t(X t ) is 
the probability to reject H having the observation X T = {X t : < t < 
T}. Let us denote by the mathematical expectation with respect to the 
measures induced by the process {X t : < t < T} in the space C[0,T] 
(the space of all the continuos functions on [0, T]). We define the class of all 
the test of asymptotic level 1 — e as 

K e = [fa ■ limsu P E£ o T (X T ) < el . 

The power function of the test based on <pT is the probability of the true 
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decision under Hi, and is given by 

(3 t ^ T )=B T Si MX T ). 

A test procedure is consistent if 

r 5m o E^^ r (X r ) = l. 

Let us introduce in the space (Cb(M), B) the Gaussian process {T(x) :i£l} 
with mean zero and covariance function given by G(x, y) = gg (x Ay), where 

/+oo 
1 { - oo , z] (y) a (y) 2 fs (y)^y 
-00 

Note that swp z g So (z) is finite if a 2 is bounded. Since the function g So is 
nondecreasing and nonnegative the limit process {r(x) :i6l} admits the 
following representation in distribution 

T(x) = B(g So (x)) (4) 

for every igK, where B denote a standard Brownian motion on the positive 
real line. The weak convergence of the process {Vr(x) : x G M} to the process 
{r(x) : x G M} in the space (Cb(R),£>) is immediate if we apply Theorem 
[5] to the functions if)(y) = lt- 0OjX ](y)a(y). Here we remark that the class T 
in Theorem [2] has to be countable. However the process {Vr{x) : x G M} 
is continuous in x, so in the current situation we can consider such class of 
functions. This result on the weak convergence of process Vr, the continuous 
mapping theorem and the representation (Hj) yield the following relations in 
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distribution 



sup |Vt(x)| sup 



sup \B(t)\ = g So (oo) sup \B(t)\ 

0<*<fl! (oo) 0<t<l 



where <7§ (oo) = f R <j(y) 2 fs (y)dy is finite, and =>- denote the weak conver- 
gence as T goes to infinity. We will consider the following statistical decision 
function 



So we have proved that §* T G K. and that the test is asymptotically distribu- 
tion free. To be the introduced statistical procedure useful we have to study 
the asymptotic properties of the statistics sup^. |Vr(x)| under the alternative 
hypotheses. This is done in the next section. 

5 Consistency of the test 

This section is devoted to the study of the asymptotic beaviour of the test 
statitistic sup |Vr(x)| under the alternative hypotheses. We have shown 
in the previous section that the proposed statistics is asymptotically distri- 
bution free. Now we prove that the proposed test procedure is also consistent 
under very general condition on the model. Let us introduce the following 
condition. 




where the critical value c £ is defined by 
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C: For some x G R it holds 



J l^WSoiv) - S 1 (y))f Sl (y)^y * 0- 

— oo 

Theorem 3. Let So and S\ belong to S a and the condition C be satisfied. 
Then the test based on the statistical decision function 



{ sso<.°°) su P*g«l VT ( x )l >Cg } 



is consistent. 



Proof. To prove the consistency it is enough to show that, under Hi 



P ( lim sup |Vt(x)| = +00 ) = 1. 



We can write 



sup |Vr(a:)| > v^Tsup — sup |V^(:r)|, 



where Vp(x) and At{x) are given as follows. Under H\, by Theorem [2] the 
process 

V&x) = ^=£ l^XMdXt - Si(X t )dt) 

weakly converges to the corresponding Gaussian process so the limit process 
is tight. On the other hand, 

A T (x) = ±£ l { _^ x] (X t )(S (X t ) - Si(X t ))dt 
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converges a.s. uniformly in x to 

+00 

Mx) = J l ( ^ x] (y)(S (y) - S 1 (y))f Sl (y)<iy- 

—00 

If the condition A is satisfied we have 

lim VTsup \At(x) I = +00 a.s. 
and the test is consistent. □ 
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